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LEAFY COTYLEDON 1 GENES AND THEIR USES 

FIELD OF THE INVENTION 
5 The present invention is directed to plant genetic engineering. In particular, 

it relates to new embryo-specific genes useful in improving agronomically important 
plants. 

BACKGROUND OF THE INVENTION 
10 Embryogenesis in higher plants is a critical stage of the plant life cycle in which 

the primary organs are established. Embryo development can be separated into two main 
phases: the early phase in which the primary body organization of the embryo is laid down 
and the late phase which involves maturation, desiccation and dormancy. In the early phase, 
the symmetry of the embryo changes from radial to bilateral, giving rise to a hypocotyl with a 
1 5 shoot meristem surrounded by the two cotyledonary primordia at the apical pole and a root 
meristem at the basal pole. In the late phase, during maturation the embryo achieves its 
maximum size and the seed accumulates storage proteins and lipids. Maturation is ended by 
the desiccation stage in which the seed water content decreases rapidly and the embryo passes 
into metabolic quiescent state. Dormancy ends with seed germination, and development 
20 continues from the shoot and the root meristem regions. 

The precise regulatory mechanisms which control cell and organ differentiation 
during the initial phase of embryogenesis are largely unknown. The plant hormone abscisic 
acid (ABA) is thought to play a role during late embryogenesis, mainly in the maturation 
stage by inhibiting germination during embryogenesis (Black, M. (1991). In Abscisic Acid. 
25 Physiology and Biochemistry, W. J. Davies and H. G. Jones, eds. (Oxford. Bios Scientific 

Publishers Ltd.), pp. 99-124) Koornneef, M., and Karssen, C. M. (1994). In Arabidopsis, E. 
M. Meyerowitz and C. R. Sommerville, eds. (Cold Spring Harbor: Cold Spring Harbor 
Laboratory Press), pp. 3 1 3-334). Mutations which effect seed development and are ABA 
insensitive have been identified in Arabidopsis and maize. The ABA insensitive (abi3) mutant 
30 of Arabidopsis and the viviparousl (vpl) mutant of maize are detected mainly during late 

embryogenesis (McCarty, et al, (1989) Plant Cell 1, 523-532 and Parcy et al, (1994) Plant 
Cell 6, 1567-1582). Both the VP I gene and the ABI3 genes have been isolated and were 
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found to share conserved regions (Giraudat, J. (1995) Current Opinion in Cell Biology 
7:232-238 and McCarty, D. R. (1995). Annu. Rev, Plant Physiol. Plant MoL Biol. 
46:71-93). The VP I gene has been shown to function as a transcription activator 
(McCarty, et al. p (1991) Cell 66:895-906). It has been suggested that ABI3 has a similar 
function. 

Another class of embryo defective mutants involves three genes: LEAFY 
COTYLEDON! and 2 (LEC1, LEC2) and FUSCA3 (FUS3) . These genes are thought to 
play a central role in late embryogenesis (Baumlein, et al. (1994) Plant J. 6:379-387; 
Meinke, D. W. (1992) Science 258:1647-1650; Meinke et aL, Plant Cell 6:1049-1064; 
West et aL, (1994) Plant Cell 6:1731-1745). Like the abi3 mutant, leafy cotyledon-type 
mutants are defective in late embryogenesis. In these mutants, seed morphology is altered, 
the shoot meristem is activated early, storage proteins are lacking and developing 
cotyledons accumulate anthocyanin. As with abi3 mutants, they are desiccation intolerant 
and therefore die during late embryogenesis. Nevertheless, the immature mutants embryos 
can be rescued to give rise to mature and fertile plants. However, unlike abi3 when the 
immature mutants germinate they exhibit trichomes on the adaxial surface of the 
cotyledon. Trichomes are normally present only on leaves, stems and sepals, not 
cotyledons. Therefore, it is thought that the leafy cotyledon type genes have a role in 
specifying cotyledon identity during embryo development. 

Among the above mutants, the led mutant exhibits the most extreme 
phenotype during embryogenesis. For example, the maturation and postgermination 
programs are active simultaneously in the led mutant (West et al., 1994), suggesting a 
critical role for LEC1 in gene regulation during late embryogenesis. 

In spite of the recent progress in defining the genetic control of embryo 
development, further progress is required in the identification and analysis of genes 
expressed specifically in the embryo and seed. Characterization of such genes would 
allow for the genetic engineering plants with a variety of desirable traits. For instance, 
modulation of the expression of genes which control embryo development may be used to 
alter traits such as accumulation of storage proteins in leaves and cotyledons. 
Alternatively, promoters from embryo or seed-specific genes can be used to direct 
expression of desirable heterologous genes to the embryo or seed. The present invention 
addresses these and other needs. 
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SUMMARY OF THE INVENTION 
The present invention is based, in part, on the isolation and characterization of 
LECI genes. The invention provides isolated nucleic acid molecules comprising a LECI 
polynucleotide sequence, typically about 630 nucleotides in length, which specifically 
hybridizes to SEQ. ED. No. 1 under stringent conditions. TheL.EC/ polynucleotides of the 
invention can encode a LECI polypeptide of about 210 amino acids, typically as shown in 
SEQ. ID. No. 2. 

The nucleic acids of the invention may also comprise expression cassettes 
containing a plant promoter operably linked to the LECI polynucleotide. In some 
embodiments, the promoter is from a LECJ gene, for instance, as shown in SEQ. ID. No. 3. 
The LECI polynucleotide may be linked to the promoter in a sense or antisense orientation. 

The invention also provides transgenic plants comprising an expression 
cassette containing a plant promoter operably linked to a heterologous LECI 
polynucleotide. The LECJ may encode a LECI polypeptide or may be linked to the 
promoter in an antisense orientation. The plant promoter may be from any number of 
sources, including a LECI gene, such a as that shown in SEQ. ID. No. 3 or SEQ. ID. No. 4. 
The transgenic plant can be any desired plant but is often a member of the genus Brassica. 

Methods of modulating seed development in a plants are also provided. The 
methods comprise introducing into a plant an expression cassette containing a plant promoter 
operably linked to a heterologous LECJ polynucleotide. The LECJ may encode a LECI 
polypeptide or may be linked to the promoter in an antisense orientation. The expression 
cassette can be introduced into the plant by any number of means known in the art, including 
through a sexual cross. 

The invention further provides expression cassettes containing promoter 
sequences from LECI genes. The promoters of the invention can be characterized by their 
ability to specifically hybridizes to a polynucleotide sequence consisting of nucleotides 1 to - 
1998 of SEQ. ID. No. 3. The promoters of the invention can be operably linked to a variety 
of nucleic acids, whose expression is to be targeted to embryos or seeds. Transgenic plants 
comprising the expression cassettes are also provided. 

The promoters of the invention can be used in methods of targeting expression 
of a desired polynucleotide to seeds. The methods comprise introducing into a plant an 
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expression cassette containing a LEC1 promoter operably linked to a heterologous 
polynucleotide sequence. 

Definitions 

The phrase "nucleic acid" refers to a single or double-stranded polymer of 
deoxy ribonucleotide or ribonucleotide bases read from the 5' to the 3 1 end. Nucleic acids 
may also include modified nucleotides that permit correct read through by a polymerase 
and do not alter expression of a polypeptide encoded by that nucleic acid. 

The phrase "polynucleotide sequence" or "nucleic acid sequence" includes 
both the sense and antisense strands as either individual single strands or in the duplex. It 
includes, but is not limited to, self-replicating plasmids, chromosomal sequences, and 
infectious polymers of DNA or RNA. 

The phrase "nucleic acid sequence encoding" refers to a nucleic acid which 
directs the expression of a specific protein or peptide. The nucleic acid sequences include 
both the DNA strand sequence that is transcribed into RNA and the RNA sequence that is 
translated into protein. The nucleic acid sequences include both the full length nucleic 
acid sequences as well as non-full length sequences derived from the full length sequences. 
It should be further understood that the sequence includes the degenerate codons of the 
native sequence or sequences which may be introduced to provide codon preference in a 
specific host cell. 

The term "promoter" refers to a region or sequence determinants located 
upstream or downstream from the start of transcription and which are involved in 
recognition and binding of RNA polymerase and other proteins to initiate transcription. A 
"plant promoter" is a promoter capable of initiating transcription in plant cells. Such 
promoters need not be of plant origin, for example, promoters derived from plant viruses, 
such as the CaMV35S promoter, can be used in the present invention. 

The term "plant" includes whole plants, plant organs (e.g.., leaves, stems, 
flowers, roots, etc.), seeds and plant cells and progeny of same. The class of plants which 
can be used in the method of the invention is generally as broad as the class of higher 
plants amenable to transformation techniques, including both monocotyledonous and 
dicotyledonous plants, as well as certain lower plants such as algae. It includes plants of a 
variety of ploidy levels, including polyploid, diploid and haploid. 
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stages of development, while the single mutants developed into mature embryos, suggesting 
that these genes act early during development. 

Further examination of the early stages of the single and double mutations 
showed defects in the shape, size and cell division pattern of the mutants suspensors. The 
5 suspensor of wild type embryo consists of a single file of six to eight cells, whereas the 
suspensors of the mutants are often enlarged and undergo periclinal divisions. Leafy 
cotyledon mutants exhibit suspensor anomalies at the globular or transition stage whereas 
wild type and abi3 mutant do not show any abnormalities. 

The number of anomalous suspensors increases as the embryos continue to 
10 develop. At the torpedo stage, the wild type suspensor cells undergo programmed cell death, 
but in the mutants secondary embryos often develop from the abnormal suspensors and, when 
rescued, give rise to twins. 

The Organization of the LEC1 Locus in Wild Type Plants and led Mutants 

Two mutant alleles of the LEC1 gene have been reported, lecl-1 and led -2 

15 (Meinke, 1992; West et al., 1994). Both mutants were derived from a population of plants 
mutagenized insertionally with T-DNA (Feldmann and Marks, 1987), although lecl-1 is not 
tagged. The lecl-2 mutant contains multiple T-DNA insertions. A specific subset of T-DNA 
fragments were found to be closely linked with the mutation. A genomic library of lecl-2 
was screened using right and left borders T-DNA as probes. Genomic clones containing 

20 T-DNA fragments that cosegragate with the mutation were isolated and tested on southern 
blots of both wild type and lecl-1 plants. Only one clone hybridized with Arahidopsis DNA 
and also gave polymorphic restriction fragment in led -J. 

The lecl-1 polymorphism resulted from a small deletion, approximately 2 kb in 
length. Using sequences from the plant fragment flanking the T-DNA, the genomic wild type 

25 DNA clones and the lecl-1 genomic clones were isolated. An EcoRI fragment of 7.4 kb of 
the genomic wild type DNA that corresponded to the polymorphic restriction fragment in 
lecl-1 was further analyzed and sequenced. The exact site of the deletion in lecl-1 was 
identified using a PCR fragment that was generated by primers, within the expected borders 
of the deleted fragment, and sequencing. 

30 In the wild type genomic DNA that corresponded to the lecl-1 deletion, a 626 

bp ORF was identified. Southern analysis of wild type DNA and the two mutants DNA 
probed with the short DNA fragment of the ORF revealed that both the wild type and led -2 



WO 98/37 1 84 PCT/US98/02998 

23 

DNA contain the ORF while the lecl-1 genomic DNA did not hybridize. The exact insertion 
site of the T-DNA in led -2 mutant was determined by PCR and sequencing and it was found 
that the T-DNA was inserted 1 15 bp upstream of the ORFs translational initiation codon in 
the 5* region of the gene. 
5 At the site of the T-DNA insertion a small deletion of 21 plant nucleic acids 

and addition of 20 unknown nucleic acids occurred. These results suggest that in led -2 the 
T-DNA interferes with the regulation of the ORF while in led- J the whole gene is deleted. 
Thus, both led alleles contain DNA disruptions at the same locus, confirming the identity of 
the LECI locus. 

10 Ths leci Mutants Cm Pe Complement by Transformation 

To prove that the 7.4 kb genomic wild type fragment indeed contained the 
ORF of the LECI gene, we used a genomic fragment of 3395 bp (SEQ. ID. No. 3) within 
that fragment to transform homozygous lecl-1 and lecl-2 plants. The clone consists of a 
3395 bp BstYI restriction fragment containing the gene and the promoter region. The 

15 translation start codon (ATG) of the polypeptide is at 1999 and the stop codon is at 2625 
(TGA). There are no introns in the gene. 

The transformed plants were selected on hygromycin plates and were tested to 
contain the wild type DNA fragment by PCR analysis. Both transgenic mutants were able to 
produce viable progeny, that were desiccation tolerant and did not posses trichomes on their 

20 cotyledons. We concluded that the 3.4 kb fragment can complement the lecl mutation and 
since there is only one ORF in the deleted 2 kb fragment in lecl-1 we suggest that this ORF 
corresponds to the LECI gene. 
The LECI Gene is a Member of Gene Family 

In order to isolate the LECI gene two cDNA libraries of young siliques were 

25 screened using the 7.4 kb DNA fragment as a probe. Seventeen clones were isolated and after 
further analysis and partial sequencing they were all found to be identical to the genomic 
ORF. The cDNA contains 626 bp ORF specifying 208 amino acid protein (SEQ. ID. Nos. 1 
and 2). 

The LECI cDNA was used to hybridize a DNA gel blot containing Ws-O 
30 genomic DNA digested with three different restriction enzymes. Using low stringency 

hybridization we found that there is at least one more gene. This confirmed our finding of 
two more Arabidopsis ESTs that show homology to the LECI gene. 
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The LECl gene is Embryo Specific 

The led mutants are affected mostly during embryogenesis. Rescued mutants 
can give rise to homozygous plants that have no obvious abnormalities other than the 
presence of trichomes on their cotyledons and their production of defective progeny. 
5 Therefore, we expected the LECl gene to have a role mainly during embryogenesis and not 
during vegetative growth. To test this assumption Poly (A)+ RNA was isolated from siliques, 
seedling, roots, leaves, stems and buds of wild type plants and from siliques of lecl plants. 
Only one band was detected on northern blots using either the LECl gene as a probe or the 
7.4 kb genomic DNA fragment suggesting that there is only one gene in the genomic DNA 

10 fragment which is active transcriptionally. The transcript was detected only in siliques 

containing young and mature embryos and was not detected in seedlings, roots, leaves, stems 
and buds indicating that the LECl gene is indeed embryo specific. In addition, no RNA was 
detected in siliques of both alleles of lecl mutants confirming that this ORF corresponds to 
the LECl gene. 

15 Expression Pattern of the LECl Gene 

To study how the LECl gene specifies cotyledons identity, we analyzed its 
expression by in situ hybridization. We specifically focused on young developing embryos 
since the mutants abnormal suspensors phenotype indicates that the LECl gene should be 
active very early during development. 

20 During embryogenesis, the LECl transcript was first detected in proglobular 

embryos. The transcript was found in all cells of the proembryo and was also found in the 
suspensor and the endosperm. However, from the globular stage and on it accumulates more 
in the outer layer of the embryo, namely the protoderm and in the outer part of the ground 
meristem leaving the procambium without a signal. At the torpedo stage the signal was 

25 stronger in the cotyledons and the root meristem, and was more limited to the protoderm 
layer. At the bent cotyledon stage the signal was present throughout the embryo and at the 
last stage of development when the embryo is mature and filling the whole seed we could not 
detect the LECl transcript. This might be due to sensitivity limitation and may imply that if 
the LECl transcript is expressed at that stage it is not localized in the mature embryo, but 

30 rather spread throughout the embryo. 

The LECl gene encodes a Homolog of CCAAT binding factor. 
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WHAT IS CLAIMED IS : 

1 . An isolated nucleic acid molecule comprising a LECl polynucleotide 
sequence, which polynucleotide sequence specifically hybridizes to SEQ. ID. No. 1 under 

5 stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, wherein the LECl 
polynucleotide is between about 100 nucleotides and about 630 nucleotides in length. 

10 3. The isolated nucleic acid molecule of claim 1, wherein the LECl 

polynucleotide is SEQ. ID. No. 1. 

4. The isolated nucleic acid molecule of claim 1, wherein the LECl 
polynucleotide encodes a LECl polypeptide of between about 50 and about 210 amino acids. 

15 

5. The isolated nucleic acid molecule of claim 4, wherein the LECl 
polypeptide has an amino acid sequence as shown in SEQ. ID. No. 2. 

6. The isolated nucleic acid molecule of claim 1, further comprising a 
20 plant promoter operably linked to the LECl polynucleotide. 

7. The isolated nucleic acid molecule of claim 6, wherein the plant 
promoter is from a LECl gene. 

25 8. The isolated nucleic acid of claim 7, wherein the LECl gene is as 

shown in SEQ. ID. No. 3. 

9. The isolated nucleic acid of claim 7, wherein the LECl gene is as 
shown in SEQ. ID. No. 4. 

30 

10. The isolated nucleic acid of claim 7, wherein the LECl polynucleotide 
is linked to the promoter in an antisense orientation. 
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11. An isolated nucleic acid molecule comprising a LECl polynucleotide 
sequence, which polynucleotide sequence encodes LECl polypeptide of between about 50 
and about 210 amino acids. 

12. The isolated nucleic acid of claim 10, wherein the LECl polypeptide 
has an amino acid sequence as shown in SEQ. ID. No. 2. 

13. A transgenic plant comprising an expression cassette containing a plant 
promoter operably linked to a heterologous LECl polynucleotide that specifically hybridizes 
to SEQ. ID. No. 1 under stringent conditions. 

14. The transgenic plant of claim 12, wherein the heterologous LECl 
polynucleotide encodes a LEC 1 polypeptide. 

15. The transgenic plant of claim 13, wherein the LECl polypeptide is 
SEQ. ID. No. 2. 

16. The transgenic plant of claim 12, wherein the heterologous LECl 
polynucleotide is linked to the promoter in an antisense orientation. 

17. The transgenic plant of claim 12, wherein the plant promoter is from a 

LECl gene. 

18. The transgenic plant of claim 16, wherein the LECl gene is as shown 
in SEQ. ID. No. 3. 

19. The transgenic plant of claim 12, which is a member of the genus 

Brassica. 



20. A method of modulating seed development in a plant, the method 
comprising introducing into the plant an expression cassette containing a plant promoter 
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operably linked to a heterologous LEC1 polynucleotide that specifically hybridizes to SEQ. 
ID. No. 1 under stringent conditions. 

21 . The method of claim 19, wherein the heterologous LEC1 
polynucleotide encodes a LEC1 polypeptide. 

22. The method of claim 20, wherein the LEC1 polypeptide has an amino 
acid sequence as shown in SEQ. ID. No. 2. 

23. The method of claim 19, wherein the heterologous LECJ 
polynucleotide is linked to the promoter in an antisense orientation. 

24. The method of claim 19, wherein the heterologous LEC1 
polynucleotide is SEQ. ID. No. 1. 

25. The method of claim 19, wherein the plant promoter is from a LEC1 

gene. 

26. The method of claim 19, wherein the LEC1 gene is as shown in SEQ. 

ID. No. 3. 

27. The method of claim 19, wherein the plant is a member of the genus 

Brassica. 

28. The method of claim 19, wherein the expression cassette is introduced 
into the plant through a sexual cross. 



29. An isolated nucleic acid molecule comprising a plant promoter that 
specifically hybridizes to a polynucleotide sequence consisting of nucleotides 1 to 1998 of 
SEQ. ID. No. 3. 
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30. The isolated nucleic acid molecule of claim 28, wherein the plant 
promoter sequence consists essentially of nucleotides 1 to 1998 of SEQ. ID. No. 3. 

3 1 . The isolated nucleic acid molecule of claim 28, wherein the plant 
promoter sequence is a subsequence of SEQ. ID. No. 4. 

32. The isolated nucleic acid molecule of claim 28, further comprising a 
polynucleotide sequence operably linked to the plant promoter sequence. 

33 . The isolated nucleic acid of claim 30, wherein the polynucleotide 
sequence operably linked to the plant promoter sequence encodes a desired polypeptide. 

34. The isolated nucleic acid molecule of claim 28, wherein the 
polynucleotide sequence is linked to the promoter in an antisense orientation. 

35. A transgenic plant comprising an expression cassette containing a 
LEC1 promoter operably linked to a heterologous polynucleotide sequence, wherein the 
LEC1 promoter specifically hybridizes to SEQ. ID. No. 3 under stringent conditions. 

36. The transgenic plant of claim 33, wherein the polynucleotide sequence 
encodes a desired polypeptide. 



37. The transgenic plant of claim 33, wherein the heterologous 
polynucleotide sequence is linked to the LEC1 promoter in an antisense orientation. 

38. The transgenic plant of claim 33, wherein the LEC1 promoter is as 
shown in SEQ. ID. No. 3. 

39. The transgenic plant of claim 33, which is a member of the genus 

Brassica. 
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40. A method of targeting expression of a polynucleotide to a seed, the 
method comprising introducing into a plant an expression cassette containing a LEC1 
promoter operably linked to a heterologous polynucleotide sequence, wherein the LECI 
promoter specifically hybridizes to a polynucleotide sequence consisting of nucleotides 1 to - 
1998 of SEQ. ID. No. 3. 

4 1 . The method of claim 38, wherein the heterologous polynucleotide 
sequence encodes a desired polypeptide. 

42. The method of claim 38, wherein the heterologous polynucleotide 
sequence is linked to the promoter in an antisense orientation. 



